wo 2005/037853 



7 APR an 



SIDE CHAIN DEUTERATED AMINO ACIDS AND METHODS OF USE 

5 BACKGROUND OF THE INVENTION 

[0001] This application claims benefit of prior co- 

pending United States provisional application serial no. 
U.S. 60/508,886, the disclosures of which are hereby 
10 incorporated by reference iri their entirety. 

1. Technical Field 

[0002] This application relates to the field of drug 
design, and in particular to methods for obtaining high 
15 resolution NMR data for large protein systems such as 

membrane receptors and protein/protein complexes . 

2 • Description of the Background Art 

[0003] Modem drug discovery research depends heavily 
20 on knowledge of the structures of biologically active 

macromolecules . This research would benefit 
substantially from enhancements in the capabilities and 
speed of three-dimensional structural analyses of 
proteins and other macromolecules, particularly valuable 
25 targets such as membrane receptors . Although 

approximately 50% of current drugs target membrane -bound 
proteins such G-Protein Coupled Receptors, at present 
virtually no structural information is available on this 
important class of molecules. 
30 [0004] The past few years have seen rapid growth in 

methods for discovery and identification of candidate 
drugs. Genomic techniques, using rapid DNA sequencing 
methods and computer assisted homology identification, 
have enabled the rapid identification of target proteins 
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as potential drug candidates. O'Brien, Nature, 385 
(6616) :472, 1997, Once identified, a target protein can 
be produced quickly using modem recombinant technology. 
Combinatorial chemistry, wherein large numbers of 
5 chemical compounds are simultaneously synthesized on 

plastic plates, frequently by robots, has revolutionized 
the synthesis of drug candidates, where libraries 
containing tens of thousands of compounds can be 
synthesized in a few months. See Gordon et al., J". Mol. 

10 Chem., 37(10) : 1385^1401, 1994, Each member of the 

library is then "screened" for binding to one or more 
target proteins. Compounds that bind are identified, and 
similar compounds are synthesized and screened. This 
process continues in an iterative manner until a drug 

15 candidate of suitably high binding affinity is 

identified. 

[0005] Having information about the three-dimensional 
structure of a target protein allows one to design a 
"focused" combinatorial library, which mimics structures 

20 matching the potential binding region of the target 

protein, increasing the likelihood of finding potential 
drug candidates that interact with the biological 
molecule of interest. Further, having information about 
the three-dimensional structure of a protein/dimg 

25 candidate con5>lex can reveal additional details about how 

and where the binding occurs as well as strengths and 
weaknesses in the interaction and hence potential avenues 
for improving desired aspects of the binding interaction. 
[0006] Unfortunately, using commonly available 

30 methods, while genomic techniques and combinatorial 

chemistry each are perfomed in months, methods to 
determine protein structure usually take much longer. 
Therefore, there is a need in the art for methods to 
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increase the speed and accuracy of obtaining high 
resolution structures of proteins, including structures 
of proteins that are the targets of potential drug 
candidates. 

[0007] X-ray crystallography is widely used to obtain 
an estimate of the structure of proteins and. can provide 
the conqplete tertiary structure (global fold) of the 
backbone of a crystallized protein. This method, 
however, has several disadvantages. For exaitple, only 
proteins which can be crystallized may be studied using 
X-ray cjrystallography , Some proteins, such as membrane 
proteins, are very difficult or impossible to 
crystallize. Moreover, crystallization of a protein can 
be very time-consuming and expensive. In addition, to 
obtain the stinucture of a protein/drug conplex it is 
often necessary to prepare a second crystal using the 
protein and the drug, thus doubling the already difficult 
process of crystallization, a time-consuming task. 
.[0008] Another major disadvantage of X-ray 
crystal lographic data is that the structural information 
obtained may be pertinent only to the crystalline 
structure of the protein and not to the structure of the 
protein in solution. Moreover, the bond angles present 
in a crystal structure may not be the same as those of 
the protein when it is in an active conformation and 
therefore may not provide information relevant to the 
biological or physiological system of interest, therefore 
providing misleading information about the structure -of 
potential binding molecules. 

[0009] Protein structure determination by high 
resolution Nuclear Magnetic Resonance (NMR) also is well 
known. In NMR spectroscopy, magnetization of certain 
atomic nuclei (usually protons) in a powerful magnetic 
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field is detected by the absorption of radio waves. NMR 
has become a major tool in the study and analysis of 
small (<lkD) molecules. To analyze larger molecules, 
such as proteins, it is essential in most applications to 
replace the natural abundance atoms of carbon and 
nitrogen ("C and "N) universally with the NMR-active 
stable isotopes "C and to allow reliable assignment of 
each of the detected NMR signals. See Ikura et al.. 
Biochemistry 29:4659-4667, 1990; Bax, Curr. Opin. Struct. 
Biol. 4:738-744, 1994. Using isotopic labeling of this 
type, NMR has allowed workers to determine the structure 
of several proteins. See Ikura et al.. Biochemistry 
30:9216-9228, 1991; Clore and Gronenbom, Wat. Struct. 
Biol. 4:849-853, 1997. 

[00010] Early methods for determining protein 
structure using NMR used distance data derived from NOE 
(Nuclear Overhauser Effect) spectra. More recently, 
residual dipolar coupling measurements have become 
established as a method to obtain additional angular 
conformational restraints for determining the solution 
structures of proteins via high resolution multinuclear 
NMR, Tolman et al,. Proa. Natl. Acad, Sci. USA 92:9270- 
9283, 1995; Tjandra et al,, J, Am. Chem. Sac. 118:6264- 
6272, 1996; Tjandra and Bax, Science 278:1111-1114, 1997; 
Bax and Tjandra, J. Biomol. NMR 10:289-292. Methods for 
weak macromolecular alignment, such as lyo tropic dilute 
liquid-crystalline solutions, have simplified measurement 
of these couplings for a variety of macromolecules. See 
Bax and Tjandra, J". Biomol. NMR 10:289-292, 1997; 
Losonczi et al., J. Biomol. NMR 12:447-451, 1998; Prosser 
et al., *J. Am. Cham. Soc. 120:11010-11011, 1998; Clore et 
al., J. Am. Chem. Soc. 120:10571-10572, 1998; Hansen et 
al., Nat. Struct. Biol. 5:1065-1074, 1998; Kiddle and 
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Romans, FEES Lett, 436:128-130, 1998; Wang et al,, J. 
Biomol. mm 12:443-446, 1998; Ottinger and Bax, J. 
Biomol. NMR 13:187-191, 1999; Fleming et al., J. Am. 
Chem. Soc. 122:5224-5225, 2000; Ruckert and Otting, J. 
Am, Chem. Soc. 122 ill S'^ -11 SI s 2000; Mueller et al., J. 
Mol. Biol. 300:197-212, 2000; Mueller et al., J. Biomol. 
NMR 18:183-188, 2000; Fowler et al., J. Mol. Biol. 
304:447-460, 2000; Hus et al., J". Mol. Biol. 298:927-936, 
2000; Hus et al., J. Am. Chem. Soc. 123:1541-1542, 2001. 
[00011] In addition, Fesik et al. (Science 274:1531-34, 
1996) have described a screening strategy in which 
libraries are screened using NMR. In this method, cui 
isotopically labeled target protein is subjected to NMR 
in the absence and presence of drug candidate molecules 
and binding is detected from perturbations in the NMR 
spectrum. NMR also can be used to determine the 
structures of protein/ligand coitqplexes in solution. See 
Shimizu et al;, J. Am. Chem. Soc, 121:5815-5816, 1999. 
[00012] Isotopic sxibstitution in a protein usually is 
accomplished by growing a bacterium or yeast, transformed 
by genetic engineering to produce the protein of choice, 
in a growth medium containing universally ^^C-, ^^N- and/ or 
^H-labeled substrates- Many such growth media are now 
commercially available. 5ee, e.g., U.S. Patent No. 
5,324,658. In practice, bacterial growth media usually 
consist of "C-labeled glucose and/or '■^N-labeled ammonium 
salts dissolved in D2O where necessary. Kay et al., 
Science 249:411, 1990 (and references therein); Bax, J. 
Am. Chem. Soc. 115:4369, 1993. Techniques for producing 
isotopically labeled proteins and other macromolecules, 
including glycoproteins, in mammalian or insect cells 
have also been described. See U.S. Patents Nos. 
5,393,669 and 5,627,044; Weller, Biochemistry 35:8815-23, 
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1996; Lustbader, J. Biomol. NMR 7:295-304, 1996. 
[00013] In principle, NMR can provide structural data 
on drug targets such as a protein, unbound and/or 
coinplexed to a drug candidate. The actual use of NMR for 
these purposes, however, has been limited to the study of 
only relatively small proteins. Because the 
magnetization of the isotopically labeled nuclei in a 
protein (^H, "C, ^^N) tends to diffuse more easily with 
increasing molecular weight, the signal-to-noise ratio 
decreases with the size of the molecule being studied, 
rendering the data more difficult to interpret as the 
protein size increases. In essence, this is because the 
very isotopes needed to assign the protein NMR signals in 
the first place, such as "C, allow the magnetization to 
diffuse. In addition, the universal labeling yields 
split signals in the NMR spectrxim. The signals being 
assigned are split into multiplets by neighboring 
isotopes, which results in both more and weaker signals. 
This splitting further degrades the signal with respect 
to noise. Taken together, these phenomena cause 
increasing overlap of signals and decreasing signal-to- 
noise ratio with increasing molecular weight, making 
determination of structure using these methods very 
laborious and time-consuming. Each of the split signals 
need to be assigned before structure determination can be 
commenced. Therefore, in practice, these methods can 
provide fairly accurate structures only of small and 
medium sized proteins. Structure determinations of - 
proteins have been restricted to sizes of about 35kD or 
less, and for the most part only to non-membrane 
proteins. Therefore, NMR has made only a modest inqpact 
to date on drug design - 

[00014] Many attempts have been made to increase the 
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sensitivity of NMR to increase the size of proteins that 
can be studied by the technique. Deuteration of "C- and 
^^-enriched protein leads to significant narrowing of the 
carbon 13 signals relative to protonated proteins, see 
Grzesiek and Bax, J. Am. Chem. Soc. 115:4369-4370, 1993, 
however, splitting of the signals from adjacent carbon 
atoms still occurs, lowering the signal-to-noise ratio 
and increasing the number of signals. 
[00015] In another development, the introduction of 
TROSY spectroscopy, Pervushin et al., Proc. Natl. Acad. 
Sci. USA 94(23) :12366-12371, 1997, has led to the 
observation of ^^N resonances on very large molecular 
systems, provided that the molecule is fully deuterated. 
However, these ^^N signals cannot be assigned without 
related carbon resonances and even with deuteration, 
carbon resonances disappear in universally "C-labeled 
proteins at much smaller molecular sizes than ^^N 
resonances accessed via TROSY methods. This method 
therefore does not increase the size of proteins that can 
be studied using NMR. ' 

[00016] Recently, methods involving specific isotopic 
enrichment of the protein with ^^C, and in the 
backbone only have overcome some of these problems and 
dramatically increased the resolution and sensitivity of 
NMR spectra in structural studies of some proteins. 
These methods facilitate detection and assignment of the 
NMR signals and the calculation of the structure of the 
protein. (Coughlin et al., J. Am. Chem. Soc. 121:11871- 
11874, 1999; Giesen et al., J. Biomol. NMR 19:255-260, 
2001; U.S. Patents Nos. 6,111,066 and 6,376,253. 
Specifically, splitting of the signals from adjacent 
carbon 13 atoms is removed by this method because side 
chain carbon 13 atoms are laclcing. In consequence, each 
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C-a carbon appears as a single signal, and if deuterated, 
with optimum intensity. In addition, because the carbon 
13 atoms are linked to the nitrogen atoms in the 
backbone, the nitrogen signals also can be assigned. 
Therefore, all the signals from the backbone of the 
protein can be assigned, 

[00017] These enhanced assigned signals then can be 
used to calculate the global fold of the protein using, 
inter alia, the measurement of dipolar couplings. Giesen 
et al., J. Biol. NMR 25:1-9, 2002. Further, the 
screening methods such as those described by Fesik et al. 
{Science 274:1531-1534, 1996), then can be performed 
using this structural information, greatly increasing the 
value of the screening. . 

[00018] Despite these advantages, however, the method 
described above for labeling a protein in the backbone 
only suffers from some serious drawbacks. First, 
although the method allows for deuteration at the C-a 
carbon, none of the other protons in the amino acid, i.e. 
in the sidechain, are deuterated. Therefore, proteins 
labeled in the backbone only according to U.S. Patents 
Nos. 6,111,066 and 6,376,253 cannot be analyzed by TROSY- 
type spectrometry. Further, this lack of deuteration 
also causes higher rates of loss of magnetization in 
general with increasing molecular mass. Therefore, a 
need exists in the art for a method that overcomes these 
di sadvan tages . 

SUMMARY OF TBE INVENTION 

[00019] Accordingly, embodiments of this invention 
provide an amino acid wherein the sidechain of the amino 
acid is isotopically enriched with and wherein the 
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backbone of the amino acid is isotopically enriched with 
an isotope selected from the group consisting of ^^C, "N, 

and any combination thereof, with the proviso that the 
amino acid is not isotopically enriched with at every 
hydrogen* Further embodiments provide an amino acid as 
described above wherein the backbone of the amino acid is 
isotopically enriched with an isotope selected from the 
group consisting of ^^C, ^^N, and any combination 

thereof. Additional further embodiments provide an amino 
acid as described above, wherein the a-carbon proton of 
the amino acid is isotopically enriched with ^H. 
[00020] In yet further embodiments, the invention 
provides a method of synthesizing the amino acids 
described above which coitqprises obtaining glycine that 
optionally is isotopically enriched in the backbone with 
an isotope selected from the group consisting of "C, "n 
and or any combination thereof; chemically derivatizing 
the glycine; adding a deuterated side chain to the 
chemically derivatized glycine in a stereo-selective 
manner to produce a protected sidechain deuterated amino 
acid; and deprotecting the sidechain deuterated amino 
acid. In yet a further embodiment, the invention 
provides a method of synthesizing the amino acids 
described above which comprises obtaining glycine that 
optionally is isotopically enriched in the backbone with 
an isotope selected from the group consisting of "C, 
and or any coiribination thereof; ch^nically derivatizing 
the glycine; adding a deuterated side chain to the 
chemically derivatized glycine in a stereo-selective 
manner to produce a protected sidechain deuterated amino 
acid; deuterating the a-carbon of the protected sidechain 
deuterated amino acid; and deprotecting the sidechain 
deuterated amino acid. 
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[00021] In further embodiments, the invention provides 
peptidic molecules which comprise at least one amino acid 
as described above. In some embodiments, the peptide 
molecules comprise at least one species of amino acid 
wherein the side chain of each occurrence of said species 
of amino acid is isotopically enriched with ^H, wherein 
the backbone of each occurrence of said species of amino 
acid is isotopically enriched with an isotope selected 
from the group consisting of "C, and any 

combination thereof, or wherein the a-carbon proton of 
each occurrence of said species of amino acid is 
isotopically enriched with ^H. , . 
[00022] In yet further embodiments, the invention 
provides media capable of supporting the growth of cells 
in culture which comprises at least one amino acid as 
described above. 

[00023] In yet further embodiments, the invention 
provides methods of producing an isotopically labeled 
peptide molecule which comprise providing a medium as 
described above; providing a cell culture that expresses 
the peptide molecule; growing the cell culture in the 
mediiun under protein-producing conditions such that the 
cell expresses the peptide molecule in isotopically 
labeled form; and isolating the isotopically labeled 
peptide molecule from the medium. 

[00024] In yet further embodiments, the invention 
provides methods of determining structural informqition 
for a peptidic molecule which comprise producing the - 
peptidic molecule according to the method described 
above; and subjecting the peptidic molecule to nuclear 
magnetic resonance. 
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[00025] Figure 1 shows a chemical synthetic scheme for 
isotopically labeled valine, 

[00026] Figure 2 shows a chemical synthetic scheme for 
a deuterated sidecliain precursor for leucine. 



DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 



[00027] Embodiments of the invention provide means for 
increasing the resolution and sensitivity of NMR spectra 
obtained from proteins, particularly large proteins such 
as membrane receptors, etc., and therefore allow more 
detailed information regarding protein structure more 
quickly and more accurately than previously possible. 
This improvement in NMR spectroscopic techniques involves 
(1) increasing the resolution and sensitivity of key 
signals in the NMR spectrum, (2) eliminating the 
splitting of the key signals by an adjacent NMR active 
nucleus and, (3) isolating the NMR active nuclei required 
to obtain the desired information on protein global fold 
in an environment of NMR inactive nuclei . This is 
accomplished by specifically isotopically labeling at 
least one of the amino acids which make up the protein to 
be studied by NMR with any combination of ^^C, ^^N, and 
in the backbone of the amino acid, with optional labeling 
of the a-carbon proton, and also with in the side chain 
of the amino acid. ' This approach is a departure from 
current NMR labeling techniques, where the goal has been 
to prepare proteins either in a universally labeled form 
(with labeling at every position in the protein molecule) 
or labeled in the backbone of the amino acid chain only, 
avoiding side chain labeling. 

[00028] Embodiments of the invention provide an amino 
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acid that is isotopically enriched with an isotope 
selected from the group consisting of "C, "N, and or 
any combination thereof in the backbone and that also is 
isotopically enriched with in the sidechain. In other 
embodiments, the invention provides a method for 
synthesizing such aioino acids which comprises (a) 
chemically derivatizing glycine and (b) adding a 
deuterated sidechain in a stereo-selective fashion. In 
other embodiments, the invention provides methods for 
synthesizing a deuterated sidechain of amino acids which 
comprise (a) deuteration of existing imlabeled sidechain 
precxirsors or (b) assembling appropriate sidechains in 
deuterated form. 

[00029] Although the methods of this invention are 
suitcJ^le for the study by NMR of cuiy peptidic molecule of 
three or more amino acids in length, and therefore 
encompasses both proteins and peptides, the description, 
for sinqplicity, will refer only to proteins. The 
discussion therefore applies to both peptides and 
proteins, even when the term protein is used. It is 
understood that the tearms ^'protein" and "peptidic 
molecule" as used in this application, both refer to any 
peptide chain of three or greater amino acids, or, for 
example, peptides cind proteins of any length. 
Preferably, the peptidic molecule is about 5 kD or 
greater molecular weight. 

[00030] The conpositions and methods of the present 
invention therefore advantageously may be employed in 
connection with proteins having molecular masses of about 
5 kD or more, or proteins of about 50 amino acid residues 
or more. The methods are particularly useful for 
proteins of 20-30 kD or larger, which have been difficult 
to study using prior art methods, and even more 
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particularly proteins of 50 or 55 kD or more or 75 kD, or 
proteins of 100 kD or longer. Therefore any protein of 
50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150 kD or 
more, or con^lexes of such proteins, are suitable for 
structural and dynamic information determinations 
according to embodiments of this invention. The methods 
may be used to study mennbrane proteins as well. Of 
course, smaller proteins and peptides may be studied 
using the inventive methods, including oligopeptides and 
any peptide of three or greater amino acids. 
[00031] Proteins containing the specifically labeled 
amino acids may be chemically synthesized from scratch or 
expressed by cells in culture, for example by bacterial, 
yeast, mammalian or insect cells.. 

(000321 Amino acids have been chemically synthesized in 
unlabeled forms by various means. Some have been 
synthesized in specifically isotopically labeled forms 
(see, for example, Martin, Isotopes Environ. Health 
Stud., 32:15, 1996; Schmidt, Jso topes Environ. Health 
Stud., 31:161, 1995). Ragnarsson et al. {J. Chem. Soc. 
Perkin Trans. 1:2503, 1994) synthesized 1,2""C2, "N Ala, 
Phe, Leu, Tyr; 1,2-"C2, 3 ' , 3 • , 3 ' -^Hj, Ala; 1,2-"C2. 
3',3'-2H2, Phe and 3 ',3\ 3^-^11^ Ala. Ragnarsson et al. 
also synthesized 1,2-"C2, 2-2h, ^^j^^Ala, Leu and Phe and 
1,2-"C2, 2,2-2H2, ^^N Gly, which were used partly for 
conformational studies of a pentapeptide, Leu-enkephalin. 
Unkefer synthesized ^^N labeled Ala, Val, Leu, Phe as well 
as 1-^^C, Val. Other methods have been described in 
Duthaler, Tetrahedron 50:1539, 1994; Schollkopf, Topics 
Curr. Chem. 109(65), 1983; Oppolzer, Tett. Letts., 
30:6009, 1989; Helvetica Chimica Acta, 77:2363, 1994; 
Helvetica Chimica Acta 75:1965, 1992. 

[00033] More recently, methods for the preparation of 
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backbone labeled, sidechain-vinlabeled amino acids have 
been developed (see U.S. Patent No. 6,111,066). In these 
methods, stereo-selective addition of the appropriate 
amino acid sidechain was added to the isotopically 
substituted glycine derivatized in a chiral complex. 
[00034] Therefore, amino acids isotopically substituted 
(enriched) with "C, ^^N, and or any combination thereof 
in the backbone of the amino acid residue as below, and 
that also is isotopically enriched with in the side 
chain have not been available in the art. Using methods 
of this invention, such amino acids advantageously may be 
produced using asymmetric synthesis from glycine, using 
an appropriately deuterated sidechain precursor, 
Glycine, specifically labeled with any combination of "c 
and "N, is readily available commercially. Therefore it 
is preferable to synthesize the amino acids using 
glycine, isotopically labeled as required, as a 
precursor. Any other known method may be used to 
synthesize the desired glycine precursor, labeled in the 
backbone with any combination of isotopic label (s). The 
formula below indicates the backbone atoms in bold. R 
represents the amino acid side chain. Therefore, 
according to this invention, atoms in the backbone which 
may be isotopically substituted with any combination of 



^H, ^^C or 
below. The 
proton is 
isotopically 
deuterixim 
hydrogen is 
deuterium or not. 



o 

II 
c 



H 



T 

R 



NH- 



are shown in bold 
alpha- carbon 
optionally 
substituted with 
whether the amino 
substituted with 
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[00035] Preferably, however, backbone- labeled glycine 
first is converted to a nickel II transition metal 
coiiplex according to the methods of Belokon et al. (j*. 
Chem. Soc. Perkin, Trans. 1:1525-1529, 1992). The 
derivatized glycine then is alkylated by treatment with a 
base, such as sodium hydroxide, sodium methoxide or 
preferably, potassium t-butoxide, followed by addition of 
the appropriate ^H- labeled sidechain precursor. 
[00036] Commercially available ^H-labeled sidechain 
precursors, such as ^H-isopropyl iodide (for valine) or 
methyl iodide (for alanine) may be used when available, 
however, not all the sidechain precursors required to 
produce all twenty naturally-occurring amino acids are 
available commercially. The present invention therefore 
provides methods of synthesizing amino acid sidechain 
precursors or elements thereof in per-deuterated form, 
allowing any protein or peptide containing any 
combination of the twenty naturally occurring amino acids 
to be synthesized in the desired isotopically enriched 
form. 

[00037] Precursors of this type can be synthesized from 
commercially available materials. Thus, (CD3)2-CD-iodide, 
the desired precursor for specifically labeled valine, can 
be prepared from CDa-labeled methyl iodide via a Grignard 
reaction with magnesium and deuterated ethyl formate, 
followed by halogenation of the resulting specifically . 
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labeled isopropyl alcohol. The resulting iodide then can 
be used to synthesize "C, and ^H-backbone labeled ^H- 

sidechain labeled valine. See Scheme 1 (Figure 1) . 
[00038] Alternatively, deuterated alkyl side chain 
precursors can be prepared by repeatedly treating 
tmlabeled, water miscible precursors with DjO in the 
presence of platinum under high pressure, 2-hydroxy-2- 
methyl propcine is per-deuterated by fo\ir treatments with 
D2O under these conditions- The perdeutero 2-hydroxy-2- 
methyl propane then can be converted to the corresponding 
iodide by treatment with HI, or the corresponding bromide 
by treatment with phosphorus tribromide. The resulting 
halide then can be added to the glycine complex in the 
presence of base to yield protected isoleucine. 
[00039] Another suiteJDle method involves assembly of 
deuterated side chain precursors by successive additions 
of deuterated methylene groups to a deuterated precursor. 
Thus, the deuterated side chain precursor for leucine may 
be assembled as in Scheme 2 . Seie Figure 2 . A deuterated 
sulfylid (1) is formed by sequentially treating 
trimethyloxosulfonium iodide with (1) D2O in the presence 
of mild base and (2) deuterated DMSO in the presence of 
strong base such as NaH, The deuterated sulphylid then 
is added to deuterated acetone to give the epoxy- compound 
shown as compound 2 in Figure 2. Rearrangement of the 
epoxide with acid yields the aldehyde (compound 3) . 
Compound 3 either may be treated with further sulphylid 
to yield epoxide (con5)Ound 4) for further chain 
extension, or reduced with sodiiam borodeuteride to give 
the alcohol (compound 5) . Treatment of compound 5 with 
hydrogen iodide yields per^deutero l-iodo-2-methyl 
propane, which on addition to the glycine complex yields 
protected leucine. 
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[00040] Deuteration at C-a can be achieved by treatment 
of the alkylated nickel /glycine complex with MeOD in the 
presence of sodium metal. See Figure 1. On completion, 
the deuterated conplex is treated with deutero-acetic 
acid. The desired backbone- IcJ^eled, sidechain-deuterated 
amino acid may be isolated by treatment with aqueous HCl 
and ion exchange chromatography or by any convenient 
method known in the cort. 

[00041] It will be apparent to those skilled in the art 
that these methods may also be engployed to synthesize 
perdeuterated amino acids with no enrichment of ^^C and 
in the backbone by starting with unlabeled glycine. 
Incorporation of specific backbone- labeled amino acids 
(for instance, into a binding site) and backbone- 
unlabeled perdeuterated amino acids (in other locations) 
into a protein can greatly simplify NMR spectra of 
peptides and proteins, particularly proteins larger than 
20-35 kD. Another embodiment of the present invention 
therefore provides methods for synthesizing deuterated 
amino acids that are unlabeled in the backbone. 
[00042] Methods for incorporating labeled amino acids 
into proteins designed to avoid scrambling have been 
described. See Coughlin et al., J. Am. Chem. Soc. 
121:11871-11874, 1999 (specifically labeled hCG in 
manHnalian cells) and Giesen et al., J. Biomol . NMR 
19:255-260, 2001 (expression of specifically labeled 
ubiquitin in bacteria) , The disclosures with regard to 
these methods are hereby incorporated by reference in the 
present specification. 

[00043] Methods for producing isotopically enriched 
peptide or protein molecules preferably involve culturing 
cells that express the molecule in a suitable growth 
medium that contains at least one isotopically enriched 
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amino acid labeled in the backbone and deuterated in the 
sidechain as described above. Such molecules may be 
produced in an i so topically enriched form by culturing 
cells that express the protein in a suitable growth 
5 medium that contains all twenty naturally occurring amino 

acids (i.e. alanine, arginine, asparagine, aspartic acid, 
cysteine, glutamic acid, glutamine, glycine, histidine, 
isoleucine, leucine, lysine, methionine, phenylalanine, 
proline, serine, threonine, tryptophan, tyrosine and 

10 valine) , where all of these amino acids are isotopically 

enriched or where less than all twenty are isotopically 
enriched, in a sidechain deuterated form. The medi\jun« as 
will be appreciated by one of skill in the art, would 
contain the species of amino acid which are desired to be 

15 labeled in the peptidic molecule in an isotopically 

enriched form while the remaining amino acids would be 
present in natural abundance form (not enriched with any 
isotope) • 

[00044] The term "^active, " when referring to NMR-active 
20 nuclei is used according to the common usage in the art 

of NMR studies. An active isotope is visible in the 
corresponding NMR spectrxim. Natural abundance refers to 
the isotopes of an atom that occur in nature. One of 
skill in the art will recognize that atoms do not exist 
25 in a single isotope in nature, and therefore that an atom 

such as carbon, for example will exist as ^^C for the most 
part, but also will exist to a certain degree as ^^C, 
naturally. Therefore a carbon- containing molecule that 
is unlabeled nevertheless will contain a small amount of 
30 isotopes other thcin the natural abxmdance isotope ^^C as 

well. Thus, a carbon position in a molecule that is 
essentially "C contains in the same or essentially the 
same ratio (abundance) as occurs in nature. Other atoms 
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such as nitrogen and hydrogen also occur naturally as 
different isotopes and therefore the term "natural 
abundance'' may be understood with respect to any atom. 
One of skill in the art also will recognize that an 
isotopically substituted, labeled or enriched atom also 
is not 100% of the stated isotope but rather is enriched 
in the stated isotope. The term enriched' refers to an 
isotope that is present at greater than natural 
abundance, up to about 5-100%, usually about 5-20% or 
about 10-20% and most preferably about 10%. The term 
""deuterated" refers to isotopic enrichment with deuterium 
(D or ^H) . 

[00045] Proteins containing specifically labeled amino 
acids can be chemically synthesized or expressed by 
bacteria, yeast, mammalian or insect cells or in cell- 
free systems, as described by Yolcoyama et al. The 
specifically isotopically (enriched) labeled amino acids 
may be incorporated into cell medium, preferably a 
mammalian or insect cell medium, individually or in any 
combination so that the protein expressed by the cells 
growing in the medium may be specifically enriched with 
the desired isotopes at the amino acid residues or 
species of amino acids of choice. The term **species of 
amino acids" refers to a particular one of the twenty 
naturally occurring amino acid types. For example, 
lysine is a species of amino acid as are alanine, 
glutamic acid and methionine. The term is used to avoid 
confusion when attempting to distinguish between a single 
amino acid, i.e. a single residue of a peptidic molecule, 
as opposed to all instances of a single type of amino 
acid in the peptidic molecule (one specific alanine in a 
peptide versus all instances of alanine in a peptide) . 
[00046] Media for bacterial, yeast, mammalian and 
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insect cells (both primary cells and cell lines) are well 
known in the art. In general, any medium which is 
sufficient to support the growth of the cells of interest 
and to suE^ort protein expression may be used. 
Compositions of the type described in U.S. Patent Nos. 
5,324,658; 5,393,669 and 5,627,044 advantageously may be 
used for the media of this invention, if desired. 
Likewise, any cell that is capable of expressing the 
peptidic molecule is suitcible for use' with this 
invention. Methods for growing and propagating cells of 
various types are known in the art. Any suitable method 
in which the cells can express the isotopically enriched 
protein may be used with the methods and coinpositions of 
this invention. Culture conditions in which the protein 
of interest is expressed in quantities sufficient to 
isolate the material from the cell culture or medium are 
termed ^^protein-producing conditions.'' 

[00047] Persons of skill in the art are aware of many 
methods for isolating proteins and peptides from cells or 
from cell media. Any of these methods may be used 
according to this invention. Peptidic molecules, once 
isolated in isotopically enriched form can be studied 
according to known methods. Any method for subjecting 
the proteins to nuclear magnetic resonance study is 
contemplated for use with the methods and conpositions of 
the invention, but preferably multidimensional NMR 
methods such as TROSY, HNCA, HNCOCA, HNCO and HNCACO are 
employed . 

EXAMPLES 

Example 1. Synthesis of L-(^^C^,"N> 50%-^H-backbone) - 

sidechain-U-^ valine. 



-20- 



wo 2005/037853 



PCT/US2004/a32941 



[00048] BPB-Ni(II)-( "C2, "N) -Glycine red conplex (1 g, 
0.97 nunol, 1,00 equiv,) was suspended in anhydrous CH3CN 
(20 itiL) at room temperature. NaO^Bu (0.2 g, 2.1 mmol,. 
1-05 equiv.) was added to the red reaction suspension 
followed after 5 minutes by (CD3) -CD- iodide (0.21 ml, 2,1 
mmol, 1.05 equiv.) in anhydrous CH3CN (10 mL) . After 4 
hours, thin layer chromatography (silica gel, 
acetone/CHClj =1/5) revealed the presence of a trace of 
unreacted starting material and a major spot with a 
higher Rf value. Glacial acetic acid (0.24 mL, d=1.049, 
63.95 mmol, 4.41 equiv.) was added to quench the 
reaction. 

[00049] The reaction mixture was concentrated under 
reduced pressure and extracted with CH2CI2 (3 x 50 mL) . 
The combined organic layers were washed with H2O (2 x 50 
mL) and then brine solution (50 inL) . The organic phase 
was dried (MgS04) and evaporated to provide a red crude 
foamy glass. The crude product was subjected to further 
purification by flash column chrcxnatography on silica gel 
using chloroform: acetone as eluant. The approporiate 
fractions were combined and evaporated to dryness to 
provide BPB-Ni (II) - (^^C2, ^^-^H-backbone) -sidechain-U-^H 
valine. 

[00050] One half of the residue was dissolved in MeOD 
(Isotec, 26 mL) , treated with sodium metal (92 mg) and 
the whole heated to refl\ix overnight. On cooling, the 
reaction mixture was treated with deutero-acetic acid 
(CIL, 1.4ml) and concentrated under reduced pressure. 
The mixture was extracted with CH2CI2 (3 x 50 mL) and the 
combined organic layers were washed with H2O (2 x 50 nlL) 
and then brine solution (20 mL) . The organic phase was 
dried (MgS04) and evaporated. The resulting red foamy 
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glass was dissolved in methylene chloride (10ml) and 
added dropwise to stirred hexane (2L) . The suspension 
was stirred overnight, filtered and the collected solid 
dried to provide BPB-Ni{II)- ("Cj/^N, -^H-backbone) - 
sidechain-U-^H valine. 

[00051] A 1:1 mixture of BPB-Ni (II) - ("Cj, ^^N, -^H- 
backbone) -sidechain-U-^H valine and BPB-Ni (ll) - ("Cj/^N, - 
2H-backbone)-sidechain-U-2H valine, CH3OH (60 mL) , in 2 M 
HCl (60 mL) were heated at reflux for 10 minutes. The 
pale gre^n solution was evaporated to dryness on a rotary 
evaporator- HjO (50 mL) was added to the dried solid. 
The mixture was cooled in an ice bath for several hotirs. 
Filtration of the mixture gave BPB.HCl. The filtrate was 
dried and the title confound isolated by ion exchange 
chromatography and crystallized from aqueous ethanol as 
("C2/"N, 50%-2H-backbone)-sidechain-U-2H valine, (M/S 
contains molecular ions at m/z 128 and 129) , 

Exanple 2. Svnthesis of perdeutero-l-iodo-2- 

methvlorooane . 

[00052] Trimethyloxosulfonium iodide (110 g, 0,5 mmol) 
was dissolved in hot D2O (500 ml). Potassium carbonate 
was added and the solution heated to 70-90OC for one 
hour, then cooled to approximately O^C for one to two 
days. The resulting solid was filtered and the process 
repeated twice to yield perdeutero-trimethyloxosulfonium 
iodide (yield, 76. 5g, 66.8%. M/S contains molecular ion 
at m/z 102) . 

[00053] Sodium hydride (6 g) was placed in a 500 mL 
flask and washed with petroleum ether by stirring and 
decanting. Residual ether was removed under reduced 
pressure, d6-dimethyl sulfoxide (150 mL) was added and 
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the suspension heated to 60-70*^C until effervescence had 
ceased- The mixture was cooled with cold H2O, 
Perdeutero-trimethyloxosulfonium iodide (57.29g, 250 
inmol) was added and the mixture stirred for 15 minutes, 
de-acetone (12,8 mL, 200 mmol) then was added. The 
mixture was stirred at room temperature for 30 minutes 
and then heated to 40-45^C for 30 minutes. After cooling 
and stirring at room temperature for a further hour, the 
flask was fitted with a distillation adapter, condenser 
and a receiver flask cooled to -70^C. The system was 
placed under water aspirator vacuiun and the reaction 
flask heated to 50*^0 to isolate perdeutero- 
methylpropylene oxide (11,7 g, 75%). 

[00054] Perdeutero-methylpropylene oxide (16.44 g, 205 
mmol) was- cooled in an ice bath and treated with DCl in 
D2O (100ml) and the whole refluxed for 18 hours. The 
reflux condenser was replaced by a distillation apparatus 
and perdeutero-isobutyraldehyde isolated by distillation. 
Yield, 9.65 g (58%) . 

[00055] Perdeutero-isobutyraldehyde (9.65 g, 120 mmol) 
was suspended in DjO and cooled in an ice bath. Sodium 
borodeuteride (5 g, 120 mmol) was added in portions over 
a 10 minute period. The mixture was stirred for 1 hour 
and then sodium chloride (approximately 12 g) was added. 
The mixture was extracted with diethyl ether (4 x 50 mL) 
and the organic extracts dried (sodium sulfate) and 
distilled through a column containing glass helices to 
give perdeutero-isobutyl alcohol as colorless liquid Jbp 
105-108OC; yield, 8.7 g (88%). 

[00056] Perdeutero-isobutyl alcohol (8,7 g, 105 mmol) 
was stirred in an ice bath while hydroiodic acid (50 mL) 
was slowly added. The mixture was then heated in an oil 
bath and slowly distilled. Crude perdeutero-isobutyl . 
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iodide was isolated {bp 80-98®C) . Water was removed with 
a burette and treatment with sodium sulfate, followed by- 
filtration . Color was removed by treatment with sodixim 
metabisulf ite and filtration to yield pure per deutero-1- 
iodo-2 -methylpropane . 

Exaitple 3 . Expression and Purification of Serine 

Hvdroxvinethvl transferase (SIIMT) containing 
Backbone-^^C^, ^H-Sidechain ^R^-L- 

valine, 

[00057] A 20 niL stock culture of M15 cells transformed 
with the vector pqe30 SHMT was used to inoculate 1 L of 
medium containing 500 mg alanine, 400 mg arginine, 400 mg 
aspartic acid, 50 mg cystreine, 400 mg glutamine, 650 mg 
glutamic acid, 550 mg glycine, 100 mg histidine, 230 mg 
isoleucine, 230 mg leucine, 420 mg lysine HCl, 250 mg 
methionine, 130 mg phenylalanine, 100 mg proline, 2.1 g 
serine, 230 mg threonine, 170 mg tyrosine, 230 mg valine, 
500 mg adenine, 650 mg guanosine, 200 mg thymine, 500 mg 
uracil, 200 mg cytosine, 1.5 g sodium acetate 
(anhydrous), 1,5 g succinic acid, 750 mg NH4CI, 850 mg 
NaOH, 10.5 g K2HPO4 (anhydrous), 2 mg CaClz 2H2O, 2 mg 
ZnS04 7H2O, 2 mg MnS04 HjO, 50 mg tryptophan, 50 mg 
thiamine, 50 mg niacin, 1 mg biotin, 20 g glucose, 4 mL 1 
M MgS04, 1 mL 0*01 M FeClj, 15 mg ampicillin, cind 50 mg 
kanamycin. 

[00058] When cell density had reached an OD of 1.2, the 
cells were harvested by centrifugation, rinsed with PBS, 
recentrifuged and resuspended in a medium of the above 
proportions but in which backbone -^^C2# ^H, -sidechain- 

^H7-Ii- valine was substituted for the iinlabeled valine. 
After 30 minutes, protein egression was induced by 
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addition of IPTG to a final concentration of 0.1 mmol. 
After 6 hours, the cultured cells were centrifuged at 
4000 rpm for 20 minutes in a Sorvall RC-3B centrifuge. 
The cell pellet was then stored at -20 °C overnight. The 
cells were thawed and resuspended in 30 ml of sonication 
buffer (50 mM sodium phosphate, 500 mM NaCl, pH=8.0). 
The cells were broken by passing them through a French 
Press four times at 20,000 psi. The broken cells were 
subjected to sedimentation at 15,000 x g for 20 minutes 
in Oakridge tubes. 

[00059] A 5 ml Ni-NTA immobilized metal affinity coltimn 
was equilibrated in sonication buffer at 5 mL/min. The 
supernatant (cell lysate) was removed from the Oakridge 
tube without disturbing the pellet* Cleared lysate was 
loaded onto the column at 5 mL/min. The column flow- 
through was saved for later analysis. The material bound 
to the column then was washed with sonication buffer for 
30 minutes until the absorbence of the effluent was less 
than 0.020- Boiind protein was eluted from the column 
with Elution Buffer (50 noM sodium phosphate, 500 mM NaCl, 
500 mM imidizole, pH=8lO) using a single step. The peak 
fraction was collected manually. A sample of the elution 
was saved for analysis. 

[00060] A 300 mL XK-50 column packed with Sephadex G-25 
Fine size exclusion chroDKitography (SEC) resin was 
equilibrated with anion exchange Buffer A (20 mM Tris 
HCl, pH = 7.5) . The Ni-NTA eluate was loaded onto the 
SEC column at 15 mL/min. The protein peak was collected 
manually. The protein sanqple, now in anion exchange 
Buffer A, was stored at 4®C during preparation of the 
next step. A 10 mL Resource Q anion exchcuige column was 
equilibrated in Buffer A. The partially purified protein 
was loaded onto the coliutm at 10 mL/min. The sanple was 
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washed with Buffer A for three minutes. The sample was 
eluted with a linear gradient into Buffer B (20 mM Tris 
HCl, 1 M NaCl, pH=7.5) over seven minutes. The fractions 
were collected in 30 second intervals. A sample of each 
fraction was set aside for analysis. 

[00061] The material was analyzed by SDS-PAGE using a 
12% Tris /Glycine gel at a constant 200 volts for 45 
minutes. The pure fractions were loaded into a 3000 MWCO 
Slidalyzer TM dialysis cassette. The protein was 
dialyzed at 4^C into 50 raM sodium phosphate pH 7.0. Two 
buffer changes ensured complete removal of the Tris 
buffer. The final protein concentration was determined 
using UV absorbcuice at 280 nm; coiiparing it to the 
extinction coefficient for MUP (0.503 at 1 mg/mL) . The 
final concentration of pure Serine Hydroxymethyl 
Transferase was 48 mg/mL in 1.9 mL. Total yield was 90 
mg. The final SHMT sample was stored at 4<^C prior to NMR 
analysis . 

Excunple 4. NMR Analysis of Serine Hvdroxvmethvl 

Transferase SHMT) Containing Backbone-"C-^ . 
^N. ^H-Sidechain ^H.-L- valine. 

[00062] A 15 mg sample of backbone-^Cj, ^^N, ^H,- 
sidechain-^H-^-L-valine labeled MUP was dissolved in 650 yL 
phosphate buffered saline (10 mM potassium phosphate; 200 
inM sodiuun chloride) , to which was added 50 pL deuterium 
oxide. A three-dimensional HNCA spectrum was acquired 
according to known methods. 
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